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1.  INTRODUCTION 


BACKGROUND 


AND 


In  a  two-level  (±1)  Random  Balance  (RB)  design  for  studying  K 
factors,  each  column  of  the  NxK  design  matrix  consists  of  N/2  +l*s 
and  N/2  -l*s  where  N  (an  even  number)  denotes  the  total  number  of 
observations.  In  each  design  column  the  +lfs  and  -l*s  are  assigned 
randomly,  making  all  possible  combinations  of  N/2  +1*8  and  N/2  -l’s 
(there  are  all)  equally  likely,  with  each  column  receiving 

an  independent  randomization.  The  RB  method  is  discussed  at  length 
by  Satterthwaite  (1959),  Budne  (1959a,  1959b,  1959c),  Anscombe  (1959), 
and  Youden,  et.al.  (1959).  It  has  been  suggested  that  the  RB  concept 
has  utility  in  the  design  of  supersaturated  (i.e.,  N  <  K)  screening 
experiments  in  which  the  researcher  attempts  to  identify,  in  a  severely 
limited  number  of  test  runs,  the  most  Important  factors  out  of  a  large 
number  of  possible  contributing  factors. 

A  major  concern  with  RB  experimental  design  is  that  there  are  no 
specific  analysis  techniques  for  these  designs.  Satterthwaite  (1959;  p.  126) 
remarks  that  practically  any  technique  used  to  analyze  data  without  RB 
properties  can  be  applied  in  a  RB  design  to  analyze  any  (suitably  small) 
subset  of  factors,  ignoring  all  other  factors.  The  simplest  approach, 
then,  would  be  to  consider  each  factor  separately  and  apply  some  standard 
parametric  or  nonparametrlc  test  of  significance.  Anscombe  (1959)  has 
suggested  that  one  might  make  a  randomization  test,  following  Welch  (1938) 
or  Tukey  (1959),  or  a  normal-theory  F-test. 

In  a  recent  paper,  Mauro  and  Smith  (1982)  considered  the  use  of  a 
standard  F-test  applied  separately  to  each  factor  as  the  method  of  analysis 
for  RB  designs.  To  approximate  power,  the  usual  normal-theory  assumptions 
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were  made.  It  was  found  that  power  calculated  under  normal  theory 
agreed  very  closely  with  corresponding  Monte  Carlo  estimates,  even  for 
relatively  small  values  of  N.  Type- I  error  probabilities  were  also 
found  to  be  in  very  close  agreement  with  nominal  a  levels.  The  overall 
extent  of  this  agreement  was  slightly  unexpected  because  it  was  known 
that  the  model  error  terms  did  not  satisfy  the  normal-theory  assumptions 
of  joint  normality  and  independence.  In  particular,  Scheff^  (1959; 
pp.  331-369)  has  indicated  that  correlation  in  the  observations  can  have 
a  peculiar  and  often  serious  effect  on  inferences  about  means.  It  is 
curious,  therefore,  why  these  violations  had  such  little  effect  in  the 
RB  model. 

The  purpose  of  this  short  note  is  to  give  a  simple  explanation  for 
the  somewhat  surprising  results  obtained  by  Mauro  and  Smith  (1982)  and 
to  give  an  improved  approximation  with  which  to  calculate  power  probabilities. 
In  our  approach,  we  observe  that  the  RB  model  (when  analyzed  with  in¬ 
dividual  F-tests)  has  exchangeably  distributed  error  terms  and  we  make 
use  of  results  derived  by  Arnold  (1979,  1981)  for  the  exchangeable  linear 
model  (Elil).  Our  discussion  provides  an  interesting  application  of  the 
ELM  and  also  serves  to  explore  the  theory  on  which  RB  experimentation  is 


based 


2.  THE  EXCHANGEABLE  LINEAR  MODEL 


The  random  variables  e^,  e^*  •  ••»  e^  are  said  to  be  exchangeably 

distributed  if  the  joint  distribution  of  e  . ,  e  . . e  is  the 

“ — — 7T 1  7T2  TTr 

same  as  the  joint  distribution  of  e^»  e^*  •  •.,  e^  for  all  permutations 
it  of  (1,  2,  ...»  r)  • 

In  the  ordinary  linear  model  (OLM)  we  assume  that  the  error  terms 
are  l.l.d.  normal  random  variables*  In  the  ELM  we  assume  exchangeably 
normally  distributed  errors.  Following  Arnold  (1981;  pp.  232-238),  the 
ELM  is  equivalently  the  model  in  which  we  observe  Y  -  Nr(±U  °2  A(p)), 
where  y  is  an  rxl  mean  vector  and  A(p)  has  the  following  form 


A(p)  - 


1  p  *  *  .  p 
pi  •  •  •  p 


P  P 


A  key  result  derived  by  Arnold  (1981)  is  that  in  an  ELM  one-way 
analysis  of  variance,  equality  of  level  means  can  be  validly  tested  with 
the  usual  F-tests  used  in  the  OLM  (i.e.,  letting  p  -  0  in  the  ELM).  We 
may  also  note  that  the  ELM  is  simply  a  repeated  measures  model  with  only 
one  individual. 
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3.  THE  RB  MODEL 


In  the  analysis  of  RB  data,  let  us  assume  the  first-order  linear 


model 


yi '  6» +  ji6Jx« +  E‘ 


.th 


Where  is  the  i  observation  (i  -  1,  2,  . ...  N),  x^  is  the  level 

t*V» 

(±1)  of  the  j —  factor  for  the  1 —  observation,  8^  is  the  (linear) 
til 

effect  of  the  j —  factor,  and  the  are  i.i.d.  N(0,  02)  random  dis- 

2  til 

turbances,  a ^  unknown.  Recall  that  in  an  RB  experiment,  the  j —  design 

column  Xj  *  ^X1  j  *  x2j*  ••••  *s  an  Nx*  vector  consisting  of  a  ran¬ 

dom  arrangement  of  N/2  +lfs  and  N/2  -lfs.  By  construction,  the  K  column 
vectors  of  the  design  matrix  X  -  (xj,  x^»  .  x^)  are  independent.  Fur¬ 
ther,  we  assume  that  X  and  £  -  (e^,  ...,  e^) f  are  independent. 

Suppose  we  wish  to  test  the  hypothesis  H^:  8^  *0  versus  H^sf^  4  0 

with  a  simple  F-test  (or,  equivalently,  a  two-sample  t-test)  applied  to 
the  observations  at  the  high  (+1)  and  low  (-1)  levels  of  the  j  —  factor. 
To  simplify  notation,  suppose  we  use  as  prototype  j  ■  1.  Further, 
without  loss  of  generality,  assume  the  observations  are  indexed  so  that 
(y^;  i  £  N/2}  have  x^j  ■  +1  and  {y^;  i  >N/2}  have  x^  -  -1.  Let  Y  - 

<V  y2 . yNr  * 

Thu.,  for  i  <  N/2  we  have  y^  *  8q  +  8j  +  e^,  an<*  ^or  *  >  we 

have  y  »  8n  -  8,  +  e  where  e.  ■  £  B4x  +  £4.  It  is  easy  to  show  that 
iuii  1  j.2  3  13  1 

Y  has  mean  vector  jj  given  by 

D  *  (80  ^  •••»  8q  ^  8)#  8q  —  8ji  •••»  8q  *  8^)' 

and  variance-covariance  matrix  l  «  (y2  +  o2)A(p)  where  y2  ■  E,  62  and 

^  J  J 


P  *  -Y2/(N-1)(y2  +  o*).  We  see*  therefore,  that  this  model  has  the 

same  covariance  structure  as  the  ELM.  Moreover,  this  correspondence 
is  independent  of  the  sample  size.  The  only  difference  between  these 
two  models  is  that  the  errors  ( e in  the  RB  model  are  not  precisely 
joint  normal. 

We  suspect,  however,  that  this  violation  has  little  effect  on  the 

F-test  for  two  reasons:  (1)  Arnold  (1980)  has  demonstrated  asymptotic 

validity  against  nonnormality  for  tests  of  this  type  for  the  repeated 

measures  model,  of  which  the  ELM  is  a  special  case.  (2)  Nonnormality 

generally  has  a  small  effect  on  tests  about  means  in  the  presence  of 

equal  groups  sampling,  zero  skewness,  and  zero  kurtosis.  In  the  RB 

K  4 

model,  each  e  has  zero  skewness  and  kurtosis  given  by  -2  Z  8./(  Y2  ^2)  * 

j-2  J  e 

which  is  clearly  dominated  by  the  term  in  the  denominator. 

As  noted  earlier,  to  determine  power  probabilities  Mauro  and  Smith 

(1982)  assumed  p  -  0  (i.e.,  independent  errors).  Under  this  assumption, 

the  individual  F-test  has  an  F-distribution  with  one  numerator  degree- 

of-freedom,  N-2  denominator  degrees~o f- freedom,  and  noncentrality  para- 
2 

meter  6  »  NBj/o2  where  a2  *  y2  +  a2  .  As  Arnold  (1981)  points  out, 

2 

the  correct  noncentrality  parameter  under  the  ELM  Is  <5  *  N8^/  [<J2(l-p)  ] . 
Mauro  and  Smith  (1982)  obtained  reasonably  good  power  approximations 
because  p  •  -y2/(N-1)(  Y2  +  o2  )  Is  generally  small.  It  Is  clear,  there¬ 
fore,  that  improved  power  probabilities  of  the  separate  F-test  in  the  RB 
model  can  be  obtained  by  using  the  factor  (1  -  p)  in  the  denominator  of 
the  noncentrality  parameter. 
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